Reinforcement Learning for Biped Robot
نویسندگان
چکیده
Animal rhythmic movements such as locomotion are considered to be controlled by neural circuits called central pattern generators (CPGs), which generate oscillatory signals. Motivated by such a biological mechanisms, rhythmic movements controlled by CPG has been studied. As an autonomous learning framework for the CPG controller, we propose an reinforcement learning method , which is called the CPG-actor-critic method. We apply this method to the reinforcement learning for the biped robot. The computer simulation shows that our method is able to train the CPG such that the biped robot walks stably. We also examine the characteristic of this CPG controller.
منابع مشابه
Episodic Reinforcement Learning Control Approach for Biped Walking
This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...
متن کاملDynamic Control Algorithm for Biped Walking Based on Policy Gradient Fuzzy Reinforcement Learning
This paper presents a novel dynamic control approach to acquire biped walking of humanoid robots focussed on policy gradient reinforcement learning with fuzzy evaluative feedback . The proposed structure of controller involves two feedback loops: conventional computed torque controller including impact-force controller and reinforcement learning computed torque controller. Reinforcement learnin...
متن کاملPoincaré-Map-Based Reinforcement Learning For Biped Walking
We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Viapoints are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of...
متن کاملBiped Balance Control by Reinforcement Learning
This work studied biped walking with single (one-leg) support and balance control using reinforcement learning. The proposed Q-learning algorithm makes a robot learn to walk without any previous knowledge of dynamics model. This balance control with single support shifts the Zero Moment Point (ZMP) of the robot to a stable region over walking sequences by means of learned gestures. Hence, the p...
متن کاملPath Planning for a Statically Stable Biped Robot Using PRM and Reinforcement Learning
In this paper path planning and obstacle avoidance for a statically stable biped robot using PRM and reinforcement learning is discussed. The main objective of the paper is to compare these two methods of path planning for applications involving a biped robot. The statically stable biped robot under consideration is a 4-degree of freedom walking robot that can follow any given trajectory on fla...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003